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1. INTRODUCTION 

This paper presents a formal definition of the scan algorithm of the TRACt 
languages The definition does not extend to a complete processor for the 

language since the definition of an evaluator for expressions is not given« 
This paper provides a rigorous starting point from which a complete pro¬ 
cessor could be constructed* 

1.1 Background of the language® 

TRAC language was developed for Interactive text or string processing. ’ Its 
basic structure is such that it can be implemented on small as well as large 
computers. The defining report for the language is [!]• Additional explan¬ 
atory information is found in [2]* TRAC language has certain features not■ 
found in many other languages because of a dynamic relationship between the 
scanning of the text of a program! and the evaluation of an expression once 
it has been successfully parsed. The essence of this relationship is given 
in figure, 1 . 



Fig. 1. Simplified state diagram of a TRAC language processor. 

The scanner proceeds from left to right. Prior to evaluation only a suffi¬ 
cient portion of the source text is proce^^sed to complete the parsing of 
the Innermost expression. At this point control is transferred to the eval- 

t Trade mark and service mar!: of the Rockford Research Institute Inc« 

! An input string is called a script rather than a program since in TRAC 
^ language both source progriim and data are intermixed on the same input. 
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uator® When_evaluation is complete^ side effects if any will have been fe-- 
corded^ the source text will have been altered^ and control is transferred''■' 
back tO' the scanner* From this point the process repeats* Inherent within 
this processor is the ability to execute functions'recursively• The des¬ 
cription of the scan algorithm must include a means of indicating a branch 
to the evaluator and subsequent return to the scanner* The scanner itself 
must provide the ability not only to parse the source text but, also to,, ■trahs- 
late the text as necessary in preparation for expression evaluation. This" 
separation of scanning and evaluating is similar to the ideas of inner and 
outer syntax as defined in [3]« 

1,2 An example of the TRAC language processor« 

Figure 2 shows the relationship between the two major■strings which provide 
the environment for the TRAC processor. The iieutral/active string is regarded 
as residing on a pair of unbounded pushdoim tapes, where the boundary between 
the tapes is indicated by the scan pointer. 



able for those not familiar with [1], In its initial configuration, the 

neutral/active string of the TRAC processor contains —4 0//(ps,#(rs)) 1— , 

n a 

In this configuration, ^ is the left end marker for the neutral string;- 

D marks the boundary between‘"the neutral and the active string; I— is the ■ 

a 

right end marker for the active string. The symbol D is called the scan 
pointer . For this examples primitive functions will be enclosed by H and ). 
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The operation of the processor is as follows: 

(1) Read-string: 

The innermost primitive expression of the initial configuration.^ is found by 
movement of the sxan pointer. This expression is #(rs) which is the read- 
string or read-ln function* This expression is then evaluated and causes 
an input string to be read from, the typewriter keyboard or other input device. 
Let the input string be #(adj5,7)® . Mthin this stringy the apostrophe ^ 
is called the metacharacter^ and it signifies the temination of the input 
.strings As a result of evaluatJonthe expression #(rs) is repalced by the 
input string, #(ad^5,7) and the mxtachara-cter Is omitted®. 

(2) Add: 

Mow the innermost expressi.on Is #{3d^5^7) • - la its evaluation^ it is replaced 
by Its value 12, which is the sum of the arguments 5 and 7« 

(3) Print-string: 

Now the innermost expression is #(ps,12) where ps stands for the print- 
string primitive function* When evaluated, this expression prints the value 
12« Since there are no further fyiiclicdis to evaluate, the processor reloads 
.itself with the initial configurationo 

1,3 Goals in the formal definition of a language* 

The goal of formal definition of a programming language is primarily to 
achieve mathematical precision in the statements which describe the language* 
Its main value is in man to man ■■commuriicatioii about the syntax and semantics 
of a language. This communication has value both for implementors and users 
of the language^ Since it is difficult to achieve the desired precision with 
natural language, other special languages are defined aod used as vehicles 
for ..definition® This was done for FL/I^ see [4], Because of the nature, of 
TM.C language, a new grammar mocielj called a type-T grammart, has been devel¬ 
oped as a vehicle for the definition of the scan algorithm* Care has" been 
taken throughout this presentation to show the relationship between, the, pre- 
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sent definition and that given in [1]« The present definition is intended 
to be precise^ compact^g complete and independent of the space and speed• 
performance of specific implementations of the algorithm* 


t A type-T grammar is an adaption of a type~"0 grammar* This adaptation allows 
a type-T rule when invoked to preserve a terminal string during the evalu¬ 
ation of the rule even when this terminal string is specified in the rule 
by means of a context-free category* Type-T rules are defined in full in 
section 3. below* “ 

The following conventions about phrase structure languages will,be assumed 
throughout* For a having as Its range^, any set of strings^ them, 

^ ■ “f- 

{a} - {a} u {e} where e is the empty stringy and where 

• 00 i 1 1^.4.,^ 1 

taj - u ^ where a =a and a -a a * ^ 

, i=i 

■Let string be an element in the set {category u terminal) * A type-0 
grammar has rules of the form ”string-l —> string-2**, „ For a type-1 grammar 
the length of string-2 must not-be less than the length of string-1* In 
type-2 (i*e* context-free) grammar rulesstring-l is restricted to a cat¬ 
egory* 

If string-2 —> string-4 is a rule^ then the application of this rule to 
the instantaneous description ”string-l string-2 string-3** produces the 
result **string-l string-4 string-3*** This operation is called a generation 
step and is denoted by: 

string-l string-2 string-3 string-1 string-4 string-3 , 

where ===> is a transitive relation between a pair of instantaneous descrip¬ 
tions* A generation sequence of n-1 steps between n instantaneous descrip- 
(ID*s) is denoted by ID-1 ^ ID-n where ^ indicates the intervening seq¬ 
uence of steps* A grammar defines a language_as the set of all terminal 
strings which can be produced by applying its rules when the initial con¬ 
figuration for each generation sequence is the sentence symbol * 

A model based on a type-0 grammar was used in [5^6] to define a variety 
of context-free parsers* A form of type-0 grammar was used in [7], to par¬ 
tially define the TRAC scan algorithm* Complete definitions of the 0,1 
and 2 grammar types can be found in [8] on page 15* A type-0 grammar- 
model is equivalent to the reduction language of Floyd as reported in [9] 
section II.B, Introduction and subsection 5* 


1.4 .Methodology of formal definition* 

A definition consists of a statement, or sequence of statements, written 
in a meta-language about some object being defined. A non-trivial meta¬ 
language must be capable-of defining an infinite class of objects* The 
meta-language must include the elements of the object language as a proper 
subset of its elements and must have other elements at the meta level which 
are syntactically independent of the object language, otherwise definition 
would not be possible. A definition may permit a recursive reference to 
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.an object being .defined^ but this recursion must be within a level of defin-- 
ition and not across levels of definition^ In the process of definition, there 
arises naturally a hierachy of language levels^ with the higher level standing 
"in a laeta relationship to the one Immediately below it» The highest rneta*™ 
...language must be natural language* This paper will utilize three levels* 

The first or top'level is English which is used to define the second level 
languages which are the context^'free giamiiiais and the type-T granmiars* These 
gramimars in turn are used to define the mapping rules between successive con- 
figurations of the neutral/active string. The set of these m,apping rules con- 
titute .the scan algorithin which is the third or target level of definition. 

1^5 Non-procedural algorithms. 

The term rule will be used to denote, a conditional statement and an associated 
action which is to be performed when the condition is satisfied.. Such a set 
of rules is usually tested for satisfaction of the conditions in a fixed se- 
' quence. This case appears in figure 3 under the title ''Sequential Evaluation”. 
Departures from this fixed sequence could be provided for by the use of a goto 
cause a branch to a rule which is not the next one in the sequence* However a 
non-procedural algorithin consists of a set of rules which can be evaluated in 
any sequence* Conceptually such a set of rules should be evaluated in parallel 
to see if the, conditions for more than one rule are satisfied. This case appears 
in figure 3 under the title “Parallel Evaluation”* If more than one rule can 
be satisfied for any configuration of the systemg then the set of roles is non- 
deterministic . On the other hand. If there is at most one rule which can be 
satisfied for every configuration of the systeEi, then the set of rules is det¬ 
erministic * If.an algorithm.of n rules Is both deterministic and non-procedural, 
then the rules can be ordered in iil different sequences all of which are logic¬ 
ally equivalent when the rules are tested for satisfaction in a fixed sequence. 



Parallel Evaluation 



Fig. 3. Alternative methods of 





evaluation for non-prbcedural algorithms 
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1*6 Relationship between a non-^procedural 'algorithm and a graminar* 

In order to explain, the relationf^hig between a non-procedural algorithm and 
a grammar it is necessary tc introduce the concept of an .a]i s.tract machine 
hawing a finite control and a fixec number of auxiliary storage tapes* This 
abstract machine has a set of rules which define its operation and comprise 
its finite control. If the macliiiie has only one state^, each rule when eval¬ 
uated returns control barb to the one contrelllng r:tate* Fi.'cni the discussion 
given in 1.5 above^ it rbeuld he evident that a one-state machine is equiva¬ 
lent to a non-'-procedural algorillm;* 

A coBventional grantiiar is eqiiivc'leat t:o a one-state macliine which defines a 
mapping from a fixed initial cunfigararion Into a set of teiminal strings* 

Tills fixed initial configuraexon consists of a distinguished category called 
the sent ence symbol * An exte nded ,gramm^^ will be defined as a .graBimar for 
which the initial configuration Js not fixed but is allowed to range over a 
a set of strings composed of o scan pointer am! terminals® Processing x^ill be 
understood to halt when no rale can be applied. An extended grammar then de¬ 
fines a mapping from one set of temiiiai strings into another. Depending on 
the granEiar^ this mapping may be either a partial or a total function. 

If the rules of a non-procedural algorithm define a mapping from one set of 
strings to another, then for every such algorithm there exists a directly equ¬ 
ivalent extended grammar., the reti-xlnder of the paper^ the term grammar 

is to be understood in the more specific sense of extended grammar® 

1.7 Definition problems peculiar to the TRAC class of languages* 

Since by definition TRAC Icinguaga oxxepts all strings as legal inputs a 
graramar defining merely legal strings is trivial, A more complex grammar 
could define a parse of the entire input script without considering the effects 
which interactive execution has in altering the script. However^ such a 
parse would actually be erroneous for mmy ron-trivial cases« These factors 




indicate that a context-free grammar of TRAC language scripts would not be a 
meaningful definitional device. Also, direct extensions of context-free nota¬ 
tion such as table grammars [10] are excluded and for the same reasons. The 
stages of redefinition of the TRAC language to be carried out in this paper 
are given in figure 3, The ocntext-free categories are defined first. These 
categories are then utilized to provide a non-procedural restatement of the 
original algorithm (Section 2,), This algorithm is subsequently restated as 
a type~T grammar (Section 4.)^ and as a decision table and its flow chart 
(Section 6.). 

1.8 The form of context-free rules. " , 

Certain context-free categories are used in the statement of the scan algorithm. 
Each of these categories is defined by a composite rule of the form; 

category expression-1 

m 

. ^ ^ « 

::= expression-n . 

A composite rule which has n rightside expressions is said to be composed of 
^ prlTaitive rules each of the form; 

category ;;= expression-i . 

Each expression- i is a string consisting either of the symbol j; or of an arb¬ 
itrary number of either categories or terminals. The symbol e stands for 
the string of length zero , also called the empty string . Each category ap¬ 
pearing within a rule has the form; 

. < X > 

where X stands for any string composed of letters and blanks. 

1.9 Terminals and undefined categories. 

With no loss of rigor, the categories <digit>, <letter>, <format character> 
and <user delimiter> will remain formally undefined. They are to be understood 




(UTILIZED WITHIN THE 
NEUTRAL/ACTIVE STRING) 


(15 PROCEDURAL RULES) 




Fig. 4. Outline of the u'-iages of redefinitica of TRAC language scan algorithm. 
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as disjoint categories each of Which has as its range a set of terminals . 

The category <dlgit> inciudtjs tl s ten numeric digits as terminals. The cat¬ 
egory <letter> includes the alphabetic characters (either upper case or lower 
case or both) as terminals’. The category <user dellmlter> is to be under¬ 
stood to include period and space and ; and ; and other special characters, 
but specifically to exclude comma and ( and ) and #. There are two additional 
terminals which are not part of the TRAC language but which are necessary for 
complete precision in formal definition. These terminals are ~1 and I— which 
are respectively the left end marker for the neutral string and the right end 
marker for the active strinj:.. Fote that the symbol called the metacharacter 
is not a context-free category since it is a terminator of an input string 
and as such does not appear in the defloition of the scan algorithm. 

the context-free categories. 

#(ps,#(rs)) 

# ^ . ■■ 

<digit> 

<Ietter> ' 

<user delimlter> 

e ■ 

( <balanced stfing> ) 

<balanced string> <balanced string> 

<text charac.ter> . . ; 

<format character> 

( <unbalanced part> 

c 

<balancad strlng>, , , 

<unbalanced part> <unbalanced part> 


1.10 Rules which define 
<idllng procedure> 

<text character> 


<balanced string> 


<unbalanced strlng> 
<unbalanced part> t ;= 



<script character> 


(■ ; 

„) ■ 

j*s <teT:t aharaater^ 
i:- <foi:iaat c;:lia£acter> 


'irgam€::nt> „ s r- e 

:..s:- <script chara.cter> ...^..... . ;. 

<::3crlpt chara.cter> C,ii-giiL.icrit> 

?xgiifiient seqiience> ;:= <argiiTiep.t> 

sr-’ ^ \rsigumeut seqiieiice> ■ . 

,11 s":ri£ij:;* 

■ -"I Jte rbat an. '^.imhaJannp.d Is defined to an ei-xesii of 

■•’Caien parens® Also sucb a al%\®ys begins wif± { « The case ’’i£ a 

a-string with an, excess of close c/ia ocaar^ hut it is handled without 

the need for a special category to Bame the substring which is involved. 


1®12 categories® 

There are no special categories to name and define the neutral string and the 
active string® These omissions are intentional* The schematic relationship 
between these strings and the scan pointer and the respective end markers is 
given ill figure 2® A context-free definition of these strings is not meaning- 
fill because interaction hetween tiie scan algorithm and the evaluator 
dy'Eaixiicc;llj alters tlie contenus of both strings, 

1.13 Representation of a grammar as a syntax graph® 

The context-free graiiimar rules of i.lO can be displayed as a syntax graph* 

Such a graph appears in 6® Ine raies by which such a graph can be 

constructed are in 5 where fi\fc specific cases aie rieoiized. 

The explanation of these five caanr ia as /t.dl-Fns; 

(1) A category J.n m grammar dofinnd by a 'cnrpoj: t:a rvla,, comicsad of i\ pcim- 
itlve rules'is representnb in fb*« fcy an encJosiiag the cat-* 

egory name %ylth < and.> ociitted and mith n directed arcs leading away from 
the elipse^ one arc for i*acli primitive rule^ 
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(2) When the rightside of a primitive rule Is a tenninalj then the terminal 
appears in the graph unchanged and with no arc leading away from it# 

(3) When a primitive role consists of simply of a category on the leftside 
and a category on the rightsidej then the rule is represented by a single 
directed arc linking two elipses^ 

X4) When the rightside of a primitive rule is a string of more than one symbol 
then the arc will point to a horizontal brace which spans the graph rep-' 

; resentation of that rightside string# In this representation terminals 
stand for themselves but categories are replaced by a circle with an arc ■ 
leading out the the actual elipse where that category is defined. 

(5) If a category is defined recursively^ the arc indicating this is given 


as a broken line. 



Case 


Grammar Format 


Graph Format 


( 1 ) 


( 2 ) 

(3) 

(4) 


(5) 


x-1 , 
t Is 

Fig» : 


<abc> X'-l 

« 

» , ■ 

<abc;> t 


<abc> <''eat‘egjrj> 


<abc> <C':ategorY> t 


<abe> :.<catc:gor 7 > <abc> 


are. arbitrary ri.ghtsidciii of 
an arbitrary terminal* 



® Equivalences between giaamiar famac and graph format for the 
representation of context-free syntax rules* 
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2*0 A MON--PB.OCEIIlfBAl. STATS^IMT OF THE SCAl il.GOPJTBM 

In [1] tlie sc-an algorilLiii .i.i a ^c^jcedtiie consistxng of fifteen rules* 

These rules are in the fora of statetFerts: and are linked together , . 

ty gc>tri*3^ Bafore tliis aAgco\l.:liu can stated as a type*T graimiar it is 
necBSSRTf to convert this irroctecliirel Hefinttfoe fa’”, f i*! f sof cf Titijr-pro.iedaral 
xjlas fci no specific order of execution is required* The 

result of this conversiori * v' '1 1.9 ^^hare thc' Si\iv nlgorlclm is 

defined in the lots al rJf^t tnj'Mn rales. Appendix 1 iric-liides a table 

■showing the relationship hefiveen ^tlfo?sltlim and the cne given :n [!]• 

3 C i h L i/ It It j* T H ii 

‘The TIAC language scan algoritbii operates on strings contained between the 
end '^erlciF -“I mil f- rith C iuc ecu-in puniLer* Iiiis scan environment 

is SQOwn in txgure 2 . ■ 

2„1 If the ect'i-ve scrlag ia ajipty, £htij. f.c-.ifct'fc the neutral string, reload 

the <idling procedure^ and position 0 to the left of tha first . ^ 

character cf the <iviilag procedure!. . - 

2.2 When a ( is encountered imraediately to the right of. Uj .llfEeraat actions 

a'ce poasihle depending oa the context 2 : 

2.2.1 If there is a strln<> the form (fbalancrd etrfn^y) .maediately 

■to tbe rigtl of Q -tlievi the eacioslng p&iens oi this <balanced string! 
are removed and [f 1:? tao-voA to the right cf the <balmical string!. 

2.2.2 Tf tha cliax-aetti. j,u«; xl-^hi. oc 0 is ( and this i is the first char™ 

acter of an ^.unbaJ.ariccd then th'S xidJi.vg procedure! lx reloaded 

beca'i>3 o,ae ct aicre clo'.o par-aux d.ce missing. 

2.3 If the character on the light, of IJ li'. u <fo'-otiat character! It in delated. 

2.4 If the tT-JO ch.-ar3ct3rs tr An . 1^.,! . Ii >H » then they are replaced 

I avcl 0 Is mowed to it? ripMa. ; * , : _ 


by 
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2.5 If the three characters to the right of D » then they are 

replaced hy ( and Q ia 'EovnJ t> its right. • 

2.6.1 If the character to the right of D la # and if the character to the right 
of the # is neither # nor (j, then D is moved to the right of #. 

2.6.2 If the two characters to the right of Q rrs ## and if the character to 

their right is not ( ^ then [] is moved to the right of the leading # , 

2.7 If the character on the right of 0 is 'a coitma. It is marked as , and 

0 is moved to its right. 

2.8 If the character or, the right cf 0 1& ) » then a leftward scan of the 

neutral string wll3 detemine which of the following three alternatives 

id to apply, n itself is not to be moved during this, leftvrard scan. 

2.8.1 If the leftward scan encounters ( prior to { or ^ then the 
intervening string is the <arg'ument striiig> for the active 
function evaluation. The evaluator replaces (<arguinent string> Q ) 
with the <value> returned and positions Q to the left of this <value> 

2.8.2 If the leftward scan encounters X prior to ( or ^ then the inter- 
veneiiiiig string is the <argumeiit striiig> for neutral function 
e¥aliiatioii. The e'/aluator replaces ( <a.rgumeiit string> 0 ) with \ 

■ either a <siiccess or & default valiie> depending on whether 


it succeeds or failSt If the ei^aluatioii succeeds^ D'"is positioned 
' to the right of <siiccess ¥abje>< If the e¥3lmtioa fails, D is 
positioned to the left of the <default ¥aliie>. 

2,8.3^ If the leftward scan encounters —i prior to I or | then the clos 

n 

paren to the right cf 0 ialeledl. 

2.9 If the character on the right of 0 2 <text character>, then D is 

moved to the right of this <text character>. 



t This rule coiifonas to [i] but currenf; standard TRAC language processors . 

are iiDpieraented as follows; *’If tlie lefcwatd scan encounters *“1 prior to 
„ n ‘ 

{ or ~( then the active string and the neutral string are deleted, and 
the <,idling procedr.reh is reloaded wlUt 0 positioned on. its left 
■f- Since tlie cui-rent contents of the fnerutral £triag> tan never be evaluated, 
the processor vill be uox•^ efficient if the <neutraJ string> is also 
deleted by this rule, ■ 


iria Fon-procedursl aspects of lie al^aciclati. 

Note that the logical conditions trader which each cf the above nine rules 

Hsay he applied ere Ir. dsper.dai'it of the KC^jiueitce iii which previous successful 

rid.as verc applied. The rules are to be tmd.ftrstood as logically unordered 
and no rule refers to another rule by aieaiis of a goto statement. The rules 

also form a partitioned set in whicn the conditions for the successful eval- 

nation nf rule are d.is.ioint frou; the conditions for the success of every 

ether rule. In addition, the rules are coBiplete. since they provide an 

exhaustive cover for all possible cases of strings in the environment of 

figure 2. ' - 

This method of speciflcat3.on does not io general assure that theie will 

exist a direct dete.rT.inistic processor. Thar is a processor which can make 

the correct choice every time «.£houK backtracking. In general when this 

method is used, it may be possible that there exists some situation in which 

more than one rule may apply. Eoweitec, In section 6. below it will be shown 

that.this iion-procedural algorithm' is deteministie. The nine 

rules may be ordered In eny one of 11 differcut ways. In all of these pos- 

sibla orderings, the operation oJ: the algorithm .is identical. In an actual 

implementation an optimum orrlerir.g of .';he rules could oe obtained by st.atisti.cal 

consiilc'rrtio’is on the o: Lne execution of each rule, together with a 

cost factor for each rule based upon the space and speed characLeriscics of the 

target machine,. 
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3. DEFINITIOM OF A TIPE-T GUMMAli ^ 

In Section 4^, x^hicli follows.* the scan algorithm is presented as a set of 
thirteen type-^T gramsiar rules. The purpose of this section is to define and 
• explain the notation to be employed in the gramrciar rules of 4« These type-T 
rules are numbered (4.1 to 4*S) in one-to-one correspciideiice with the verbal 
.non-procedural statement of each rule given above (zil to 2,9)^ 

Type-T. rule schema. 

The Bclieaia for type-^T rules is given :ln figure 7» Unless stated otherwise 


leftside rightslde 


coiitext-I D coatext-2 


(T u C)^ 




eontext~3 Q coritext**4 



CT u K) 


Mote that the quantification given for the leftside has been simplified. 
Fig® 7® Schema for type-T rules® ■• ■ 

quantification is as given in figure 7* 

Notation: 


T is the set of teralnals* 

C is the.set of context-free categories® The exteiided-^class of categories 
given in 3«2 below are also context-free categories, 

K - {c|,C 2 » •ti® -which is defined for a particular rule as the set of 

categories appearing on the leftside of that rule* Thus figure 7 requires 
that the quantification of the "*rightside” be restricted to those categories 
which also appear in the corresponding ^leftside” of the same rule* 

V « T y this is the set of the total vocabulary* 


Quantification: 


The precise quantification of' **leftside” witiiin a type-T rule is not given in 


figure 7® This quantification is given, by: 

"leftside” Is in {VO u |]V - 

This added precision assure;; that, no leftside of a rule will be represented 
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by Q standing alone. 

Syntactic Restriction ; 

For a given rule and for .ia !(j then ? Synonyms will 

5 allov?ed for the context-free categories so that the restriction does not 
tually restrict the parer. of the lertsids. 
mantic Restriction ; 

the same category appears' on hutu the leftside and on the rightslde of-.a' 
ule, then the exact terminal string spanned by the leftside category Is pre- 
erved during f.he c.val'iatiD,a cf the rule. However the position of that string 
Ithin the context may have been al.at.ered in accordance with the. C|H 2 cificatlo 
iven by the rightside,. This is explained iu detail in 3,3 to h,'/ below. 
lema rk; 

rhare do not exist any type—X categories in thi.s graumar system.., All categ¬ 
ories must be defined by aome con.text-free graimiar. Xhls fact indicates the 
type of a acan which is nexessary to determi.ne .If the conditions of a rule 
are satisfied, 4 pt’shdcwi a-atoinatoa sill be powerful enough to carry out the 
required trial parse. An example of the type of parsing required by a type-T 
rule is given in figure 8. 

3.2 Extended-class of categories. 

In listing the rules vZ a. graaunar, it is sometimes much more compact to 
define a category by ennuraeratlng wbst is ret: in f:’ie category rather than 
by the usual method cf erincaaerating what is_‘in the category. For this 
purpose, the following defijaition’is given; 

{ list ]>*. i: ... { Use }, 

The expression { list } is to be replaced by specific terminals ir. flia form: 

. { terminal, j * . »' I f.ernlnil }. 

i ‘ ^ . [a 

For exsnsple thf r,r.fogo:-y means; "any symbol in E except if or (", 

When 'list*' is a single tramlnal the surroimdlng .f and } are omitted. 



The set of terminals (denoted by E) include the three undefined 

terminal categories given in above and the end markers® Thus, 

I = {<digit> I <letter> | <ijser deliiiiiter> | # | 

-t ( I ) I . I ( I ) i ;■ 1 i fx) . 


1) Tree Format 2 Q (O) 



a a a a a 


‘(2) Stratified Formats ■ for <balanced 3tring> 

i«- 

0 (aA(aaa)a) 


Fig. 8. Example of the parse produced to match the leftside of a type-T rule. 


3.3 The computation step def.ined by a type-T rule. 


A context“free language is so defined that its generation sequence will ter¬ 


minate. A type-T grammar as used hare does not define a terminating sequenccj 


but rather each rule defines a computation step in a potenially infinite se¬ 


quence, As in phrase structixr.-i languagas, a transitive step is denoted by => 

* 

and a sequence of steps by ^ ^ 




If [ja aO is a rule che.i bcQaci Jicai^ci ij a legal step which results 


from the application cf the rule. If cQa —> fQ is a rule then bcQad 


3«4 Context-free cutegorit's w.I£.hia a eype-T lale. 

Fissure 9 gives an cxar'ole of the '"'per-'tir't of a rape-l|rijie J which the same 


category appears on both the leftside and the rlghtside of a rule 


Praserviition of a uertalnal character under a c 
the operation .-if tee tnrefc sut>.iteps within a t 


a single .'atcggorj <x'. anf one tyre-T ru’-j H'-xV <:..>n 


tep is shown broken down into its three substeps. Note that 


throughout the mapping, the trnni<, al wa i presetf/ed vi.'v. not changed to a or 


to <x>) 3 althoi-fgh '■.his is not axp"* iciily -^tai-■>d hw the pe-T ruTe. The expian 


atlon of each substep is as follows 


This substep discovers a local parse which is a match for the category <x> 


This match provides satisfaction lor ?,lie leftside of the type-T rule, Th 


new perse Infcscmatfon .is .rv-or'-d c' th/ t’r ;;ee cattipor'y tnccks .abo-ia tije icr 


mlnal track 
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(2) Main substep “ s 

This substep carries out the operation indicated by the rule in accordance 
with the restriction that *’the terminal string under a category is pre¬ 
served", 

(3) Unparsing substep ”= : 

This substep deletes the parsing Information from the three category tracks, 
so that the. tracks will be empfcj for the application of the next rule. 

The number c£ categories in a finite grammar must be finite. Thus although 
the names identifying actual categories may be composed of several letters, it 
is always possible to recode the category names onto some finite alphabet in 
which each category is represented by a single unique letter. Hence for the 
examples of figure 9, 10 and II only single letter categories are employed. 

In an actual grammar such a restriction is unnecessary, 

3,5 A cattjgory which spans an arbitrary nuitibar of characters, 

A special utilization of the three "category tracks’* is required in the case 
where a category spans more than a single terminal character. An example of 
this case is given in figure 10, Note that < on the first category track 
marks the beginning of the span, and that > on the third category track marks 
the end of the span. The category name, "y" is repeated once for each term¬ 
inal in the string it spans. This method of processing is equivalen to hand¬ 
ling parsing trees by their nod names. Figure 11 gives a summary of track 
utilization. In this figure, each of the terminals within the span of the 
category <z> is marked by a z on track C2, The terminals under each z are 
preserved unaltered during the evaluation of the rule. 
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Cater^nry; <y> 

Context-free £ 


my 


laaaac I'aair-Cc 


Jf’ig. }0, A type-T £ teo vti'isjing a «'ategory which spans a string 


Fig., 11. Summary of the utilization of the category tracks 
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306 Case of tlie empty string* - ’ 

In the context-“free ■ grammar-given in I^IO above, cases occur where for 
some category <x>, <x> “> e. In this case there would be no place on the 
tape to write <x> over since r does not occupy a character position, 

^^itliout giving a fomaJ praoft it is claiined that this case can be handled 
without altering the definitional power ol the system® 

307 notation for transfer to the evaluator* 

Tlif^ ejffibol will be used rn place oi —> to indicate that the execution 

of a rule so designated will require a transfer of control temporarily to the 
evaluator before the scan-algorithm can restime its processing* The categories 
<value>, <sijccess value> miA ^default va]ue> will be allowed on the rightside 
of a rule marked with These thy:ee categories will represent strings of 

terminal symbols but the exact specification of what these strings contain lies 
outside the scope of this paper. This information is dependent upon the for¬ 
mal definition of the evaluator itself. However <success value> and 
<default value> are mutually exclusive in the sense that one or the other 
must equal e. With’this restriction in mind then the following holds: 

<¥alue> « <success valu8> <clefault valiie> , 


t To see that this is true, note that <x> can appear only a finite number 
of times on the rightside of die various context-free rules« The rule 
<x> £ can be deleted and its effect can be retained by adding a set 

of parallel rules to the grammaro These new rules are added by using the 
formula <x>«£ .and performing a uniform substitution throughout tha right- 
sides of the other rule."" in the original grammar« This expanded granmiar 
will define the same language mi the categexy <5i> will no Icn.gex generate 
£• The naming significance of <3c> will be unaltered except with respect 
to €# This same expensive substitution nust also be performed wicii respect 
to the tules of the type-^T grammar® 





• J 


. ::) 


4. T£?E- 

•T GPJliiiiF, FDR THE SCAM ALGORITHM OF THE TRAC LANGUAGE 



SyBtax 


Comments 

4.i 

D h 


•d n #(ps,//(ra)) 

n * » 

Re-initiallze to the Idling procedure. 

4.2/X 

n C <balancea string> ) 

• c’fcalaneed striBg> 0 

Remove protecting parens from a 
balanced string. 

4.2,2 

n <unbalSliced stri'ng> 


d n #(p3,#(rs)) d 

Re-initialize due to missing close paren. 

4.3 

0 <foma£- character> 


D 

Delate unprotected format character. 

4.4 

■ D ■#( 


( D 

Mark the open paren of an active function. 

4.5 

0 ##( 

—> 

( D 

Mark the open paren of a neutral function. 

4 s 6. i 

i 

0 #<# {^i(» 


#n <#{#!<» 

Step past a #, 

4.6,2 

Q ##<# (> 


# D #<?« C>. ■ 

Step pas-! ", 

4.7 

D e 


; D ■ 

Mark s cf5i3J4.i ^^nich separates arguments. 

4.S.1 

' { C'a^isuiaaat string> 0 ) 


n <value> 

Execute active function evaluation. 

4.s.2t 

“ “ 1 

(Carguaeut string> 0; } 


<eucca3a valueD D <defaalt value> 

Execute neutral function evaluation. 

4«S,3f 

: <&rgi<ment strlng> D ) 


d <argutaaat string> Q ■ 

Delete excess close paren. 

4-9 

0 <text character> 


<text charac£er> D j 

Step past & text character. 


t <sticcess veiuc> and sdefaalt \'alua> are mutually exclusive in the 

sense that , ■ 


one or tbe otlier must squsl 

r.j, the empty string. 



T See footnote on 2*8^3 for an alternate interpretation® 

NJ 

* 
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5. EXAMPLE OF SCRIPT PROCESSING 

In table la sequence of eight instantaneous descriptions (ID’s) are given 

which Illustrate configurations during the processing of a script in which both 

input and output occur. The following additional conventions are adopted: 

and [j- are end markers for the input string. 

* is the metacharacter for the ®'read string” function. 

and j-^ are the end markers for the printer string. 

Table II gives a detailed explanation of the processing which occurs between 

* 

each pair of instantaneous descripticx.s, S represents any string, 

ID Neutral/active string Input string Printer string 


1 


2 

no cliauge 

no change 


II 

If 

^ Yl'Tps7D##(rs))) 


11 


no change 

n 

6 Y^Tps7),(D)) 


ft 


BO change 



It 

no change 

Table I.Typical sequence of 

instantaneous descriptions (ID^s)® 

Processing Sequence of + 

Step ■ type-O rules 

Coiaments 


IDi*=^ID2 1 

Reinitialize, 


ID2^ID3 4,9,9,7,4,9,9 

Scan for close 

paren*. 

ID3=5^n)ij 8.1 

Execute active 

”read string” function. 

IDq-^IDs 5,9,9 

Scan for close 

paren* 

IDs^oIDg 8*2 

•Execute neutral 

”read string” function. 

IDg^oIDy 8.1 

Execute active 

”print string” fiinction« 

ID7==>ID3 8.3 

Delete excess c. 

lose paren entered into ID4 


^by the active "read string” function. 

The next siap is rale 4oi, 

t The initial 4. has been deleted from each of the rule numbers 
In this column. 

Table II. Details of the processing steps utilized In the. example of table I. 
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6, BOUNDED CAONTEXT ANALYSIS OF THE TYPE-T RULES 

Some of the rules of Section 4» require a scan over string of unbounded 
length to complete the evaluation of a rule. However such unbounded scans 
are not necessary to determine which rule of the set will succeed and hence 
is to be applied next. The detemtnistlc logic aacessary to achieve this 
decision requires at most the exoiaiuEtlO’Xs 3f three eharacterst to the right 
of the <scar pointer>, 1 euxiRarj of this analysis is given as an extended 
entry decision table In Table III. The corresponding actions appear in Table 
IVo This decision table is cci.^ettad into one possible flow chart in fig¬ 
ure 12„ The use of a horiaontai format for the rules in the decision table 
allows the three characters under scan to he viewed in. their natural pos¬ 
itions, i.e. horizontal and left to right. 

t This Is known as a hounded context, scan and in the notation of Knuth is 
LR(3). See [9] sections II.Cl and II,G3 for the required definitions. 





Conditions 


Rl 

R2 

R3 

* Actions 



V, 

1 

a 



2 

=<fc> 



3 


= ( 


4 



* ( 

5 




6 




6 



H 

6 

s 

$ 



7 




8 

*=<tc> 



9 


<fc>= <format character>, 

<tc>= <text character>, 

Rl, R2 and R3 stand for the first, second and third 
characters to the right of the scan pointer. 

Table III. Extended entry decision table for bounded context rules 
In horizontal format. 


Action 

Definition 

Rule 

1 

Re-“initialize to the idling procedure. 

4.1 

2 

Remove the protecting parens from a <balanced string> and 
place the scan pointer immediately to the right of that 
<balanced string>j if the parse discovers <unbalanced string> 



execute re-initialization. 

4.2 

3 

Delete unprotected <format character>. 

4*3 

4 

Mark the open paren of an active function and step past it 



while deleting the 0 , 

4.4 

5 

Mark the open paren of a neiitral function and step past it 

J 


while deleting the 00 , 

4.5 

6 

Step past a 0 which is not followed by ( or by #(. 

4.6 

7 

Mark a comma as an argument separator and step past it. 

4.7 

8 

Execute a leftward scan which will determine which of the 
following actions is to be taken: 



(a) Evaluate an active function. 

4.8.1 


(b) Evaluate a neutral function. 

4.8.2 


(c) Delete an excess close paren. 

4.8.3 

9 

Step past a <text character>. 

4.9 


Table IV. Details of actions referred to in Table III. 














30 


A P P E N D I X I 

Relationship Between The Original Algorithm And The New Non-procedural Algorithm. 

Let Rn (for n=l,..,,16) stand for the rules of [1], R15 can be eliminated 

by adding to the end of rules R3, R4, R7j R9, Rll, R12 and R13 the action 

specified by R15. In [1] immediately following R15 is the following statement: 

•'Extra close parens are ignored and are deleted 
at the end of a procedure." 

For the purpose of this appendix, let this statement be referred to as R16. 

In [1] there is Informally implied the presence of an auxiliary pushdown 
used to record and to retrieve pointers to the neutral string. Note in par¬ 
ticular the statements in R4, R5, R6 and R8 about a pointer to the "current 
location" and the statement in R8 about a pointer to the "current function"!. 
This implied auxiliary pushdown has been replaced by adding to the tape alph- 

I 

abet the marked characters ( and ^ and , . These marked characters can be 
retrieved by a leftward scan. The relationship between the two algorithms 
Is summarized in Table V. • 


f It was not the intent of [1] to specify the details of implementation for 
a TRAC language processor. 


New Rule. 

Context 

Old Rule 

Relation! 

2.1 

re-inltialize 

R1,R14 

D 

2.2 

DC 

R2 

I.l 

2.3 

D<format character> 

R3 

D 

2.4 

UH 

R5 

D 

2.5 

mi 

R6 

D 

2.6 

D# [not R5,R6] 

R7 

D 

2.7 

D. 

R4 

D 

2.8 

D) 

R8,R10,R13 

1.3 

2.8.1 

T ...D) ' 

Rll 

D 

2.8.2 

T ...D) 

R12 

1.3 

2.8.3 

-1 ...D) 

.. R16 

1.2 

2.9 

otherwise 

R9 

. D 

t D « 

I.x 

there exists an obvious and direct relation between 
the rules in the two systems. 

= see this appendix at Section I.x for comments on this 

case. 

Table V. 

. Relationship between new rules 

and old rules« 
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1.1 Balanced strings. 

In R2 no provision was made for the case where the expected close paren 
is omitted. Rule 2',2,2 makes explicit provision for this situation. 

1.2 Close paren. 

In rule 2,8a leftward scan for I or T or —I replaces the ^'retrieve pointer" 
operation of R8, In the non-procedural algorithm, the case of —I is hand¬ 
led by 2.8.3 while in |1] it is defined by R16, See also footnote to 2.8.3, 
in this paper, 

1.3 Neutral evaluation. 

The rule 2.8.2 is so stated as to take into account a special case which 

occurs when a neutral function returns a <default value>. This case 

is covered in [1] by the non-procedural statement; 

"The overflow value (in this paper called <default value> ) is 
always treated as if it were produced by an active function." 

This is found in [1] on page 218 under "Arithmetic Functions". 

new rule 2.8.2 provides for this case directly. 


The 
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